Analyzing Semantic Concept Paerns to Detect Academic Plagiarism
نویسندگان
چکیده
Detecting academic plagiarism is a pressing problem, e.g., for educational and research institutions, funding agencies, and academic publishers. Existing plagiarism detection systems reliably identify (nearly) copied text, but oen fail to detect disguised forms of academic plagiarism, such as paraphrases, translations, and idea plagiarism. We present Semantic Concept Paern Analysis an approach that performs an integrated analysis of semantic text relatedness and structural text similarity. Using 25 ocially retracted cases of academic plagiarism, we demonstrate that our approach can detect cases that established text matching approaches would not identify. We see the approach as a promising addition to improve the detection capabilities for strong paraphrases. We plan to further improve Semantic Concept Paern Analysis and include the approach as part of an integrated detection process that analyzes heterogeneous similarity features to beer identify the many possible forms of plagiarism in academic documents.
منابع مشابه
English-Persian Plagiarism Detection based on a Semantic Approach
Plagiarism which is defined as “the wrongful appropriation of other writers’ or authors’ works and ideas without citing or informing them” poses a major challenge to knowledge spread publication. Plagiarism has been placed in four categories of direct, paraphrasing (rewriting), translation, and combinatory. This paper addresses translational plagiarism which is sometimes referred to as cross-li...
متن کاملIdentifying free text plagiarism based on semantic similarity
It is common knowledge that plagiarism in academia goes as back in time as research itself. However, in the last two decades this phenomenon of academic deception has turned into an academic plague. Undoubtedly, the rapid expansion of the Web and the vast amount of publicly available information and documents facilitate the unethical malpractice of computer-aided plagiarism, which in turn has i...
متن کاملFuzzy-Semantic Similarity for Automatic Multilingual Plagiarism Detection
A word may have multiple meanings or senses, it could be modeled by considering that words in a sentence have a fuzzy set that contains words with similar meaning, which make detecting plagiarism a hard task especially when dealing with semantic meaning, and even harder for cross language plagiarism detection. Arabic is known by its richness, word’s constructions and meanings diversity, hence c...
متن کاملWeb-based Demonstration of Semantic Similarity Detection Using Citation Pattern Visualization for a Cross Language Plagiarism Case
In a previous paper, we showed that analyzing citation patterns in the well-known plagiarized thesis by K. T. zu Guttenberg clearly outperformed current detection methods in identifying cross-language plagiarism. However, the experiment was a proof of concept and we did not provide a prototype. This paper presents a fully functional, web-based visualization of citation patterns for this verifie...
متن کاملAnalyzing Similarity in Mathematical Content To Enhance the Detection of Academic Plagiarism
Despite the effort put into the detection of academic plagiarism, it continues to be a ubiquitous problem spanning all disciplines. Various tools have been developed to assist human inspectors by automatically identifying suspicious documents. However, to our knowledge currently none of these tools use mathematical content for their analysis. This is problematic, because mathematical content po...
متن کامل